Search CORE

13 research outputs found

A Unified Coded Deep Neural Network Training Strategy Based on Generalized PolyDot Codes for Matrix Multiplication

Author: Bai Ziqian
Dutta Sanghamitra
Grover Pulkit
Jeong Haewon
Low Tze Meng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/11/2018
Field of study

This paper has two contributions. First, we propose a novel coded matrix multiplication technique called Generalized PolyDot codes that advances on existing methods for coded matrix multiplication under storage and communication constraints. This technique uses "garbage alignment," i.e., aligning computations in coded computing that are not a part of the desired output. Generalized PolyDot codes bridge between Polynomial codes and MatDot codes, trading off between recovery threshold and communication costs. Second, we demonstrate that Generalized PolyDot can be used for training large Deep Neural Networks (DNNs) on unreliable nodes prone to soft-errors. This requires us to address three additional challenges: (i) prohibitively large overhead of coding the weight matrices in each layer of the DNN at each iteration; (ii) nonlinear operations during training, which are incompatible with linear coding; and (iii) not assuming presence of an error-free master node, requiring us to architect a fully decentralized implementation without any "single point of failure." We allow all primary DNN training steps, namely, matrix multiplication, nonlinear activation, Hadamard product, and update steps as well as the encoding/decoding to be error-prone. We consider the case of mini-batch size

B=1

, as well as

B>1

, leveraging coded matrix-vector products, and matrix-matrix products respectively. The problem of DNN training under soft-errors also motivates an interesting, probabilistic error model under which a real number

(P,Q)

MDS code is shown to correct

P-Q-1

errors with probability

1

as compared to

\lfloor \frac{P-Q}{2} \rfloor

for the more conventional, adversarial error model. We also demonstrate that our proposed strategy can provide unbounded gains in error tolerance over a competing replication strategy and a preliminary MDS-code-based strategy for both these error models.Comment: Presented in part at the IEEE International Symposium on Information Theory 2018 (Submission Date: Jan 12 2018); Currently under review at the IEEE Transactions on Information Theor

arXiv.org e-Print Archive

Crossref

Differentially Private Secure Multiplication: Hiding Information in the Rubble of Noise

Author: Cadambe Viveck R.
Calmon Flavio P.
Devulapalli Ateet
Jeong Haewon
Publication venue
Publication date: 27/09/2023
Field of study

We consider the problem of private distributed multi-party multiplication. It is well-established that Shamir secret-sharing coding strategies can enable perfect information-theoretic privacy in distributed computation via the celebrated algorithm of Ben Or, Goldwasser and Wigderson (the "BGW algorithm"). However, perfect privacy and accuracy require an honest majority, that is,

N \geq 2t+1

compute nodes are required to ensure privacy against any

t

colluding adversarial nodes. By allowing for some controlled amount of information leakage and approximate multiplication instead of exact multiplication, we study coding schemes for the setting where the number of honest nodes can be a minority, that is

N< 2t+1.

We develop a tight characterization privacy-accuracy trade-off for cases where

N < 2t+1

by measuring information leakage using {differential} privacy instead of perfect privacy, and using the mean squared error metric for accuracy. A novel technical aspect is an intricately layered noise distribution that merges ideas from differential privacy and Shamir secret-sharing at different layers.Comment: Extended version of papers presented in IEEE ISIT 2022, IEEE ISIT 2023 and TPDP 202

arXiv.org e-Print Archive

Fully-Decentralized Coded Computing for Reliable Large-Scale Computing

Author: Haewon Jeong (4939678)
Publication venue
Publication date: 30/03/2020
Field of study

In this thesis, I ask the question “how do we compute reliably using thousands of distributed, unreliable nodes?” We propose a system-level solution where we add redundant data across distributed nodes using the technique called “coded computing.” Our main contribution is developing strategies for a masterless, fullydecentralizedsetting for important computation primitives in machine learning (ML) and scientific computing applications while minimizing the overhead of coding. For distributed matrix multiplication, we make a fundamental advance by proposingcoded computing strategies that outperform prior works by an unbounded factor, including recently-developed coded computing strategies as well as traditional Algorithm-Based Fault Tolerance (ABFT) strategies. We also propose coded computing schemes for other primitives such as fast Fourier transform (FFT) and matrix QR factorization. Completing computation reliably and in time under diverse unpredictabilities (e.g., stragglers, node failures, bit flips) is becoming a more important problem. The amount of data we collect is growing exponentially and recent developments in ML have enabled utilizing and processing such large quantities of data. This has not only led to an increase in the scale of computing but also the wide popularity of largescalecomputing across our society. I will discuss how masterless coded computing can be a more efficient fault-tolerance technique under growing unpredictability in computing systems, providing both theoretical and experimental evidence

Energy-Adaptive Error Correcting for Dynamic and Heterogeneous Networks

Author: Haewon Jeong
Pulkit Grover
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

STI strategies for poverty reduction-The case of Lao PDR

Author: Kang Haewon
Lee Jeong Hyop
Maliphol Sira
Publication venue: [Sejong]
Publication date: 01/01/2014
Field of study

K-Developedia(KDI School) Repository

Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

Author: Calmon Flavio P.
Jeong Haewon
Wang Hao
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 13/04/2022
Field of study

We investigate the fairness concerns of training a machine learning model using data with missing values. Even though there are a number of fairness intervention methods in the literature, most of them require a complete training set as input. In practice, data can have missing values, and data missing patterns can depend on group attributes (e.g. gender or race). Simply applying off-the-shelf fair learning algorithms to an imputed dataset may lead to an unfair model. In this paper, we first theoretically analyze different sources of discrimination risks when training with an imputed dataset. Then, we propose an integrated approach based on decision trees that does not require a separate process of imputation and learning. Instead, we train a tree with missing incorporated as attribute (MIA), which does not require explicit imputation, and we optimize a fairness-regularized objective function. We demonstrate that our approach outperforms existing fairness intervention methods applied to an imputed dataset, through several experiments on real-world datasets

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Randomized, back-to-back trial of a new generation NBI with a high-definition white light (HQ290) for detecting colorectal polyps

Author: Bong Min Ko
Haewon Kim
Hyeon Jeong Goong
Hyun Gun Kim
Moon Sung Lee
Seong Ran Jeon
Yu Sik Myung
Yun Ho Jung
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref